Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 3670 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1273 |
| Duplicate rows (%) | 34.7% |
| Total size in memory | 688.3 KiB |
| Average record size in memory | 192.0 B |
Variable types
| Text | 14 |
|---|---|
| Categorical | 10 |
| Dataset has 1273 (34.7%) duplicate rows | Duplicates |
X10 is highly overall correlated with X11 and 8 other fields | High correlation |
X11 is highly overall correlated with X10 and 6 other fields | High correlation |
X2 is highly overall correlated with X10 and 8 other fields | High correlation |
X3 is highly overall correlated with X10 and 8 other fields | High correlation |
X4 is highly overall correlated with X10 and 8 other fields | High correlation |
X6 is highly overall correlated with X10 and 7 other fields | High correlation |
X7 is highly overall correlated with X10 and 7 other fields | High correlation |
X8 is highly overall correlated with X10 and 8 other fields | High correlation |
X9 is highly overall correlated with X10 and 8 other fields | High correlation |
Y is highly overall correlated with X10 and 8 other fields | High correlation |
X4 is highly imbalanced (52.0%) | Imbalance |
Y is highly imbalanced (52.0%) | Imbalance |
Reproduction
| Analysis started | 2023-12-10 20:30:52.285511 |
|---|---|
| Analysis finished | 2023-12-10 20:30:54.985675 |
| Duration | 2.7 seconds |
| Software version | ydata-profiling vv4.6.3 |
| Download configuration | config.json |
X1
Text
| Distinct | 63 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 5.6168937 |
| Min length | 5 |
Characters and Unicode
| Total characters | 20614 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | LIMIT_BAL |
|---|---|
| 2nd row | 20000 |
| 3rd row | 120000 |
| 4th row | 90000 |
| 5th row | 50000 |
| Value | Count | Frequency (%) |
| 50000 | 453 | 12.3% |
| 20000 | 236 | 6.4% |
| 30000 | 191 | 5.2% |
| 200000 | 182 | 5.0% |
| 80000 | 165 | 4.5% |
| 180000 | 135 | 3.7% |
| 360000 | 122 | 3.3% |
| 100000 | 118 | 3.2% |
| 140000 | 117 | 3.2% |
| 150000 | 115 | 3.1% |
| Other values (53) | 1836 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 15154 | |
| 1 | 1189 | 5.8% |
| 2 | 1174 | 5.7% |
| 3 | 783 | 3.8% |
| 5 | 743 | 3.6% |
| 8 | 391 | 1.9% |
| 6 | 385 | 1.9% |
| 4 | 381 | 1.8% |
| 7 | 213 | 1.0% |
| 9 | 183 | 0.9% |
| Other values (7) | 18 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20596 | |
| Uppercase Letter | 16 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15154 | |
| 1 | 1189 | 5.8% |
| 2 | 1174 | 5.7% |
| 3 | 783 | 3.8% |
| 5 | 743 | 3.6% |
| 8 | 391 | 1.9% |
| 6 | 385 | 1.9% |
| 4 | 381 | 1.8% |
| 7 | 213 | 1.0% |
| 9 | 183 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 4 | |
| I | 4 | |
| M | 2 | |
| T | 2 | |
| B | 2 | |
| A | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20598 | |
| Latin | 16 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15154 | |
| 1 | 1189 | 5.8% |
| 2 | 1174 | 5.7% |
| 3 | 783 | 3.8% |
| 5 | 743 | 3.6% |
| 8 | 391 | 1.9% |
| 6 | 385 | 1.9% |
| 4 | 381 | 1.8% |
| 7 | 213 | 1.0% |
| 9 | 183 | 0.9% |
Latin
| Value | Count | Frequency (%) |
| L | 4 | |
| I | 4 | |
| M | 2 | |
| T | 2 | |
| B | 2 | |
| A | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20614 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 15154 | |
| 1 | 1189 | 5.8% |
| 2 | 1174 | 5.7% |
| 3 | 783 | 3.8% |
| 5 | 743 | 3.6% |
| 8 | 391 | 1.9% |
| 6 | 385 | 1.9% |
| 4 | 381 | 1.8% |
| 7 | 213 | 1.0% |
| 9 | 183 | 0.9% |
| Other values (7) | 18 | 0.1% |
X2
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| female | |
|---|---|
| male | |
| SEX | 2 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.160218 |
| Min length | 3 |
Characters and Unicode
| Total characters | 18938 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SEX |
|---|---|
| 2nd row | female |
| 3rd row | female |
| 4th row | female |
| 5th row | female |
Common Values
| Value | Count | Frequency (%) |
| female | 2130 | |
| male | 1538 | |
| SEX | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 2130 | |
| male | 1538 | |
| sex | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5798 | |
| m | 3668 | |
| a | 3668 | |
| l | 3668 | |
| f | 2130 | 11.2% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| X | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18932 | |
| Uppercase Letter | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5798 | |
| m | 3668 | |
| a | 3668 | |
| l | 3668 | |
| f | 2130 | 11.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2 | |
| E | 2 | |
| X | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18938 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5798 | |
| m | 3668 | |
| a | 3668 | |
| l | 3668 | |
| f | 2130 | 11.2% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| X | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18938 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5798 | |
| m | 3668 | |
| a | 3668 | |
| l | 3668 | |
| f | 2130 | 11.2% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| X | 2 | < 0.1% |
X3
Categorical
HIGH CORRELATION 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| university | |
|---|---|
| graduate school | |
| high school | |
| other | 27 |
| EDUCATION | 2 |
Length
| Max length | 15 |
|---|---|
| Median length | 11 |
| Mean length | 12.033787 |
| Min length | 5 |
Characters and Unicode
| Total characters | 44164 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EDUCATION |
|---|---|
| 2nd row | university |
| 3rd row | university |
| 4th row | university |
| 5th row | university |
Common Values
| Value | Count | Frequency (%) |
| university | 1644 | |
| graduate school | 1401 | |
| high school | 596 | 16.2% |
| other | 27 | 0.7% |
| EDUCATION | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| school | 1997 | |
| university | 1644 | |
| graduate | 1401 | |
| high | 596 | 10.5% |
| other | 27 | 0.5% |
| education | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 4021 | 9.1% |
| i | 3884 | 8.8% |
| s | 3641 | 8.2% |
| h | 3216 | 7.3% |
| e | 3072 | 7.0% |
| r | 3072 | 7.0% |
| t | 3072 | 7.0% |
| u | 3045 | 6.9% |
| a | 2802 | 6.3% |
| 1997 | 4.5% | |
| Other values (16) | 12342 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 42149 | |
| Space Separator | 1997 | 4.5% |
| Uppercase Letter | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 4021 | 9.5% |
| i | 3884 | 9.2% |
| s | 3641 | 8.6% |
| h | 3216 | 7.6% |
| e | 3072 | 7.3% |
| r | 3072 | 7.3% |
| t | 3072 | 7.3% |
| u | 3045 | 7.2% |
| a | 2802 | 6.6% |
| l | 1997 | 4.7% |
| Other values (6) | 10327 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2 | |
| D | 2 | |
| U | 2 | |
| C | 2 | |
| A | 2 | |
| T | 2 | |
| I | 2 | |
| O | 2 | |
| N | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1997 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42167 | |
| Common | 1997 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 4021 | 9.5% |
| i | 3884 | 9.2% |
| s | 3641 | 8.6% |
| h | 3216 | 7.6% |
| e | 3072 | 7.3% |
| r | 3072 | 7.3% |
| t | 3072 | 7.3% |
| u | 3045 | 7.2% |
| a | 2802 | 6.6% |
| l | 1997 | 4.7% |
| Other values (15) | 10345 |
Common
| Value | Count | Frequency (%) |
| 1997 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44164 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 4021 | 9.1% |
| i | 3884 | 8.8% |
| s | 3641 | 8.2% |
| h | 3216 | 7.3% |
| e | 3072 | 7.0% |
| r | 3072 | 7.0% |
| t | 3072 | 7.0% |
| u | 3045 | 6.9% |
| a | 2802 | 6.3% |
| 1997 | 4.5% | |
| Other values (16) | 12342 |
X4
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 54 |
| 0 | 10 |
| MARRIAGE | 2 |
Length
| Max length | 8 |
|---|---|
| Median length | 1 |
| Mean length | 1.0038147 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3684 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MARRIAGE |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 2045 | |
| 1 | 1559 | |
| 3 | 54 | 1.5% |
| 0 | 10 | 0.3% |
| MARRIAGE | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 2045 | |
| 1 | 1559 | |
| 3 | 54 | 1.5% |
| 0 | 10 | 0.3% |
| marriage | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2045 | |
| 1 | 1559 | |
| 3 | 54 | 1.5% |
| 0 | 10 | 0.3% |
| A | 4 | 0.1% |
| R | 4 | 0.1% |
| M | 2 | 0.1% |
| I | 2 | 0.1% |
| G | 2 | 0.1% |
| E | 2 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3668 | |
| Uppercase Letter | 16 | 0.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| R | 4 | |
| M | 2 | |
| I | 2 | |
| G | 2 | |
| E | 2 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2045 | |
| 1 | 1559 | |
| 3 | 54 | 1.5% |
| 0 | 10 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3668 | |
| Latin | 16 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 4 | |
| R | 4 | |
| M | 2 | |
| I | 2 | |
| G | 2 | |
| E | 2 |
Common
| Value | Count | Frequency (%) |
| 2 | 2045 | |
| 1 | 1559 | |
| 3 | 54 | 1.5% |
| 0 | 10 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3684 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2045 | |
| 1 | 1559 | |
| 3 | 54 | 1.5% |
| 0 | 10 | 0.3% |
| A | 4 | 0.1% |
| R | 4 | 0.1% |
| M | 2 | 0.1% |
| I | 2 | 0.1% |
| G | 2 | 0.1% |
| E | 2 | 0.1% |
X5
Text
| Distinct | 53 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.000545 |
| Min length | 2 |
Characters and Unicode
| Total characters | 7342 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | AGE |
|---|---|
| 2nd row | 24 |
| 3rd row | 26 |
| 4th row | 34 |
| 5th row | 37 |
| Value | Count | Frequency (%) |
| 29 | 214 | 5.8% |
| 27 | 185 | 5.0% |
| 30 | 174 | 4.7% |
| 26 | 158 | 4.3% |
| 24 | 155 | 4.2% |
| 32 | 152 | 4.1% |
| 34 | 151 | 4.1% |
| 28 | 147 | 4.0% |
| 31 | 145 | 4.0% |
| 35 | 135 | 3.7% |
| Other values (43) | 2054 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1760 | |
| 2 | 1584 | |
| 4 | 1159 | |
| 5 | 649 | 8.8% |
| 6 | 426 | 5.8% |
| 7 | 408 | 5.6% |
| 9 | 387 | 5.3% |
| 0 | 334 | 4.5% |
| 8 | 332 | 4.5% |
| 1 | 297 | 4.0% |
| Other values (3) | 6 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7336 | |
| Uppercase Letter | 6 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1760 | |
| 2 | 1584 | |
| 4 | 1159 | |
| 5 | 649 | 8.8% |
| 6 | 426 | 5.8% |
| 7 | 408 | 5.6% |
| 9 | 387 | 5.3% |
| 0 | 334 | 4.6% |
| 8 | 332 | 4.5% |
| 1 | 297 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| G | 2 | |
| E | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7336 | |
| Latin | 6 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 1760 | |
| 2 | 1584 | |
| 4 | 1159 | |
| 5 | 649 | 8.8% |
| 6 | 426 | 5.8% |
| 7 | 408 | 5.6% |
| 9 | 387 | 5.3% |
| 0 | 334 | 4.6% |
| 8 | 332 | 4.5% |
| 1 | 297 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| A | 2 | |
| G | 2 | |
| E | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7342 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 1760 | |
| 2 | 1584 | |
| 4 | 1159 | |
| 5 | 649 | 8.8% |
| 6 | 426 | 5.8% |
| 7 | 408 | 5.6% |
| 9 | 387 | 5.3% |
| 0 | 334 | 4.5% |
| 8 | 332 | 4.5% |
| 1 | 297 | 4.0% |
| Other values (3) | 6 | 0.1% |
X6
Categorical
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0 | |
|---|---|
| -1 | |
| 1 | |
| 2 | |
| -2 | |
| Other values (5) | 46 |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.2912807 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4739 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PAY_0 |
|---|---|
| 2nd row | 2 |
| 3rd row | -1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1741 | |
| -1 | 786 | |
| 1 | 495 | 13.5% |
| 2 | 327 | 8.9% |
| -2 | 275 | 7.5% |
| 3 | 25 | 0.7% |
| 4 | 9 | 0.2% |
| 8 | 9 | 0.2% |
| PAY_0 | 2 | 0.1% |
| 7 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1741 | |
| 1 | 1281 | |
| 2 | 602 | 16.4% |
| 3 | 25 | 0.7% |
| 4 | 9 | 0.2% |
| 8 | 9 | 0.2% |
| pay_0 | 2 | 0.1% |
| 7 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1743 | |
| 1 | 1281 | |
| - | 1061 | |
| 2 | 602 | 12.7% |
| 3 | 25 | 0.5% |
| 4 | 9 | 0.2% |
| 8 | 9 | 0.2% |
| P | 2 | < 0.1% |
| A | 2 | < 0.1% |
| Y | 2 | < 0.1% |
| Other values (2) | 3 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3670 | |
| Dash Punctuation | 1061 | 22.4% |
| Uppercase Letter | 6 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1743 | |
| 1 | 1281 | |
| 2 | 602 | 16.4% |
| 3 | 25 | 0.7% |
| 4 | 9 | 0.2% |
| 8 | 9 | 0.2% |
| 7 | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1061 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4733 | |
| Latin | 6 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1743 | |
| 1 | 1281 | |
| - | 1061 | |
| 2 | 602 | 12.7% |
| 3 | 25 | 0.5% |
| 4 | 9 | 0.2% |
| 8 | 9 | 0.2% |
| _ | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4739 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1743 | |
| 1 | 1281 | |
| - | 1061 | |
| 2 | 602 | 12.7% |
| 3 | 25 | 0.5% |
| 4 | 9 | 0.2% |
| 8 | 9 | 0.2% |
| P | 2 | < 0.1% |
| A | 2 | < 0.1% |
| Y | 2 | < 0.1% |
| Other values (2) | 3 | 0.1% |
X7
Categorical
HIGH CORRELATION 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0 | |
|---|---|
| -1 | |
| 2 | |
| -2 | |
| 3 | 31 |
| Other values (6) | 22 |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.3370572 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4907 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PAY_2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1901 | |
| -1 | 787 | |
| 2 | 487 | 13.3% |
| -2 | 442 | 12.0% |
| 3 | 31 | 0.8% |
| 7 | 9 | 0.2% |
| 4 | 4 | 0.1% |
| 1 | 3 | 0.1% |
| PAY_2 | 2 | 0.1% |
| 5 | 2 | 0.1% |
Length
| Value | Count | Frequency (%) |
| 0 | 1901 | |
| 2 | 929 | |
| 1 | 790 | |
| 3 | 31 | 0.8% |
| 7 | 9 | 0.2% |
| 4 | 4 | 0.1% |
| pay_2 | 2 | 0.1% |
| 5 | 2 | 0.1% |
| 6 | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1901 | |
| - | 1229 | |
| 2 | 931 | |
| 1 | 790 | |
| 3 | 31 | 0.6% |
| 7 | 9 | 0.2% |
| 4 | 4 | 0.1% |
| P | 2 | < 0.1% |
| A | 2 | < 0.1% |
| Y | 2 | < 0.1% |
| Other values (3) | 6 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3670 | |
| Dash Punctuation | 1229 | 25.0% |
| Uppercase Letter | 6 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1901 | |
| 2 | 931 | |
| 1 | 790 | |
| 3 | 31 | 0.8% |
| 7 | 9 | 0.2% |
| 4 | 4 | 0.1% |
| 5 | 2 | 0.1% |
| 6 | 2 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1229 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4901 | |
| Latin | 6 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1901 | |
| - | 1229 | |
| 2 | 931 | |
| 1 | 790 | |
| 3 | 31 | 0.6% |
| 7 | 9 | 0.2% |
| 4 | 4 | 0.1% |
| _ | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 2 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4907 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1901 | |
| - | 1229 | |
| 2 | 931 | |
| 1 | 790 | |
| 3 | 31 | 0.6% |
| 7 | 9 | 0.2% |
| 4 | 4 | 0.1% |
| P | 2 | < 0.1% |
| A | 2 | < 0.1% |
| Y | 2 | < 0.1% |
| Other values (3) | 6 | 0.1% |
X8
Categorical
HIGH CORRELATION 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0 | |
|---|---|
| -1 | |
| -2 | |
| 2 | |
| 4 | 14 |
| Other values (6) | 41 |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.3487738 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4950 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PAY_3 |
|---|---|
| 2nd row | -1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1875 | |
| -1 | 797 | |
| -2 | 475 | 12.9% |
| 2 | 468 | 12.8% |
| 4 | 14 | 0.4% |
| 3 | 11 | 0.3% |
| 7 | 10 | 0.3% |
| 6 | 9 | 0.2% |
| 5 | 6 | 0.2% |
| 1 | 3 | 0.1% |
Length
| Value | Count | Frequency (%) |
| 0 | 1875 | |
| 2 | 943 | |
| 1 | 800 | |
| 4 | 14 | 0.4% |
| 3 | 11 | 0.3% |
| 7 | 10 | 0.3% |
| 6 | 9 | 0.2% |
| 5 | 6 | 0.2% |
| pay_3 | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1875 | |
| - | 1272 | |
| 2 | 943 | |
| 1 | 800 | |
| 4 | 14 | 0.3% |
| 3 | 13 | 0.3% |
| 7 | 10 | 0.2% |
| 6 | 9 | 0.2% |
| 5 | 6 | 0.1% |
| P | 2 | < 0.1% |
| Other values (3) | 6 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3670 | |
| Dash Punctuation | 1272 | 25.7% |
| Uppercase Letter | 6 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1875 | |
| 2 | 943 | |
| 1 | 800 | |
| 4 | 14 | 0.4% |
| 3 | 13 | 0.4% |
| 7 | 10 | 0.3% |
| 6 | 9 | 0.2% |
| 5 | 6 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1272 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4944 | |
| Latin | 6 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1875 | |
| - | 1272 | |
| 2 | 943 | |
| 1 | 800 | |
| 4 | 14 | 0.3% |
| 3 | 13 | 0.3% |
| 7 | 10 | 0.2% |
| 6 | 9 | 0.2% |
| 5 | 6 | 0.1% |
| _ | 2 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4950 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1875 | |
| - | 1272 | |
| 2 | 943 | |
| 1 | 800 | |
| 4 | 14 | 0.3% |
| 3 | 13 | 0.3% |
| 7 | 10 | 0.2% |
| 6 | 9 | 0.2% |
| 5 | 6 | 0.1% |
| P | 2 | < 0.1% |
| Other values (3) | 6 | 0.1% |
X9
Categorical
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0 | |
|---|---|
| -1 | |
| -2 | |
| 2 | |
| 3 | 29 |
| Other values (5) | 33 |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.353951 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4969 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PAY_4 |
|---|---|
| 2nd row | -1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1995 | |
| -1 | 753 | 20.5% |
| -2 | 538 | 14.7% |
| 2 | 322 | 8.8% |
| 3 | 29 | 0.8% |
| 5 | 12 | 0.3% |
| 4 | 9 | 0.2% |
| 7 | 9 | 0.2% |
| PAY_4 | 2 | 0.1% |
| 6 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1995 | |
| 2 | 860 | |
| 1 | 753 | 20.5% |
| 3 | 29 | 0.8% |
| 5 | 12 | 0.3% |
| 4 | 9 | 0.2% |
| 7 | 9 | 0.2% |
| pay_4 | 2 | 0.1% |
| 6 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1995 | |
| - | 1291 | |
| 2 | 860 | |
| 1 | 753 | 15.2% |
| 3 | 29 | 0.6% |
| 5 | 12 | 0.2% |
| 4 | 11 | 0.2% |
| 7 | 9 | 0.2% |
| P | 2 | < 0.1% |
| A | 2 | < 0.1% |
| Other values (3) | 5 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3670 | |
| Dash Punctuation | 1291 | 26.0% |
| Uppercase Letter | 6 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1995 | |
| 2 | 860 | |
| 1 | 753 | 20.5% |
| 3 | 29 | 0.8% |
| 5 | 12 | 0.3% |
| 4 | 11 | 0.3% |
| 7 | 9 | 0.2% |
| 6 | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1291 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4963 | |
| Latin | 6 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1995 | |
| - | 1291 | |
| 2 | 860 | |
| 1 | 753 | 15.2% |
| 3 | 29 | 0.6% |
| 5 | 12 | 0.2% |
| 4 | 11 | 0.2% |
| 7 | 9 | 0.2% |
| _ | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4969 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1995 | |
| - | 1291 | |
| 2 | 860 | |
| 1 | 753 | 15.2% |
| 3 | 29 | 0.6% |
| 5 | 12 | 0.2% |
| 4 | 11 | 0.2% |
| 7 | 9 | 0.2% |
| P | 2 | < 0.1% |
| A | 2 | < 0.1% |
| Other values (3) | 5 | 0.1% |
X10
Categorical
HIGH CORRELATION 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0 | |
|---|---|
| -1 | |
| -2 | |
| 2 | |
| 3 | 18 |
| Other values (4) | 32 |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.3553134 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4974 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PAY_5 |
|---|---|
| 2nd row | -2 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1996 | |
| -1 | 749 | 20.4% |
| -2 | 547 | 14.9% |
| 2 | 328 | 8.9% |
| 3 | 18 | 0.5% |
| 4 | 18 | 0.5% |
| 7 | 10 | 0.3% |
| PAY_5 | 2 | 0.1% |
| 5 | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1996 | |
| 2 | 875 | |
| 1 | 749 | 20.4% |
| 3 | 18 | 0.5% |
| 4 | 18 | 0.5% |
| 7 | 10 | 0.3% |
| pay_5 | 2 | 0.1% |
| 5 | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1996 | |
| - | 1296 | |
| 2 | 875 | |
| 1 | 749 | 15.1% |
| 3 | 18 | 0.4% |
| 4 | 18 | 0.4% |
| 7 | 10 | 0.2% |
| 5 | 4 | 0.1% |
| P | 2 | < 0.1% |
| A | 2 | < 0.1% |
| Other values (2) | 4 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3670 | |
| Dash Punctuation | 1296 | 26.1% |
| Uppercase Letter | 6 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1996 | |
| 2 | 875 | |
| 1 | 749 | 20.4% |
| 3 | 18 | 0.5% |
| 4 | 18 | 0.5% |
| 7 | 10 | 0.3% |
| 5 | 4 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1296 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4968 | |
| Latin | 6 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1996 | |
| - | 1296 | |
| 2 | 875 | |
| 1 | 749 | 15.1% |
| 3 | 18 | 0.4% |
| 4 | 18 | 0.4% |
| 7 | 10 | 0.2% |
| 5 | 4 | 0.1% |
| _ | 2 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4974 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1996 | |
| - | 1296 | |
| 2 | 875 | |
| 1 | 749 | 15.1% |
| 3 | 18 | 0.4% |
| 4 | 18 | 0.4% |
| 7 | 10 | 0.2% |
| 5 | 4 | 0.1% |
| P | 2 | < 0.1% |
| A | 2 | < 0.1% |
| Other values (2) | 4 | 0.1% |
X11
Categorical
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0 | |
|---|---|
| -1 | |
| -2 | |
| 2 | |
| 3 | 30 |
| Other values (5) | 21 |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.3817439 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5071 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PAY_6 |
|---|---|
| 2nd row | -2 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1879 | |
| -1 | 810 | |
| -2 | 583 | 15.9% |
| 2 | 347 | 9.5% |
| 3 | 30 | 0.8% |
| 6 | 8 | 0.2% |
| 7 | 6 | 0.2% |
| 4 | 4 | 0.1% |
| PAY_6 | 2 | 0.1% |
| 8 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1879 | |
| 2 | 930 | |
| 1 | 810 | |
| 3 | 30 | 0.8% |
| 6 | 8 | 0.2% |
| 7 | 6 | 0.2% |
| 4 | 4 | 0.1% |
| pay_6 | 2 | 0.1% |
| 8 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1879 | |
| - | 1393 | |
| 2 | 930 | |
| 1 | 810 | |
| 3 | 30 | 0.6% |
| 6 | 10 | 0.2% |
| 7 | 6 | 0.1% |
| 4 | 4 | 0.1% |
| P | 2 | < 0.1% |
| A | 2 | < 0.1% |
| Other values (3) | 5 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3670 | |
| Dash Punctuation | 1393 | 27.5% |
| Uppercase Letter | 6 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1879 | |
| 2 | 930 | |
| 1 | 810 | |
| 3 | 30 | 0.8% |
| 6 | 10 | 0.3% |
| 7 | 6 | 0.2% |
| 4 | 4 | 0.1% |
| 8 | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1393 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5065 | |
| Latin | 6 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1879 | |
| - | 1393 | |
| 2 | 930 | |
| 1 | 810 | |
| 3 | 30 | 0.6% |
| 6 | 10 | 0.2% |
| 7 | 6 | 0.1% |
| 4 | 4 | 0.1% |
| _ | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| P | 2 | |
| A | 2 | |
| Y | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5071 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1879 | |
| - | 1393 | |
| 2 | 930 | |
| 1 | 810 | |
| 3 | 30 | 0.6% |
| 6 | 10 | 0.2% |
| 7 | 6 | 0.1% |
| 4 | 4 | 0.1% |
| P | 2 | < 0.1% |
| A | 2 | < 0.1% |
| Other values (3) | 5 | 0.1% |
X12
Text
| Distinct | 2138 |
|---|---|
| Distinct (%) | 58.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 4.5122616 |
| Min length | 1 |
Characters and Unicode
| Total characters | 16560 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 980 ? |
|---|---|
| Unique (%) | 26.7% |
Sample
| 1st row | BILL_AMT1 |
|---|---|
| 2nd row | 3913 |
| 3rd row | 2682 |
| 4th row | 29239 |
| 5th row | 46990 |
| Value | Count | Frequency (%) |
| 0 | 244 | 6.6% |
| 390 | 31 | 0.8% |
| 780 | 12 | 0.3% |
| 316 | 11 | 0.3% |
| 396 | 10 | 0.3% |
| 200 | 8 | 0.2% |
| 2400 | 7 | 0.2% |
| 291 | 7 | 0.2% |
| 2000 | 7 | 0.2% |
| 819 | 6 | 0.2% |
| Other values (2120) | 3327 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2375 | |
| 0 | 1894 | |
| 2 | 1794 | |
| 4 | 1678 | |
| 3 | 1595 | |
| 5 | 1454 | |
| 6 | 1449 | |
| 8 | 1414 | |
| 9 | 1406 | |
| 7 | 1402 | |
| Other values (8) | 99 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16461 | |
| Dash Punctuation | 83 | 0.5% |
| Uppercase Letter | 14 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2375 | |
| 0 | 1894 | |
| 2 | 1794 | |
| 4 | 1678 | |
| 3 | 1595 | |
| 5 | 1454 | |
| 6 | 1449 | |
| 8 | 1414 | |
| 9 | 1406 | |
| 7 | 1402 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 83 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16546 | |
| Latin | 14 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2375 | |
| 0 | 1894 | |
| 2 | 1794 | |
| 4 | 1678 | |
| 3 | 1595 | |
| 5 | 1454 | |
| 6 | 1449 | |
| 8 | 1414 | |
| 9 | 1406 | |
| 7 | 1402 | |
| Other values (2) | 85 | 0.5% |
Latin
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16560 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2375 | |
| 0 | 1894 | |
| 2 | 1794 | |
| 4 | 1678 | |
| 3 | 1595 | |
| 5 | 1454 | |
| 6 | 1449 | |
| 8 | 1414 | |
| 9 | 1406 | |
| 7 | 1402 | |
| Other values (8) | 99 | 0.6% |
X13
Text
| Distinct | 2092 |
|---|---|
| Distinct (%) | 57.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 4.439782 |
| Min length | 1 |
Characters and Unicode
| Total characters | 16294 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 971 ? |
|---|---|
| Unique (%) | 26.5% |
Sample
| 1st row | BILL_AMT2 |
|---|---|
| 2nd row | 3102 |
| 3rd row | 1725 |
| 4th row | 14027 |
| 5th row | 48233 |
| Value | Count | Frequency (%) |
| 0 | 328 | 8.9% |
| 390 | 17 | 0.5% |
| 200 | 14 | 0.4% |
| 316 | 13 | 0.4% |
| 291 | 10 | 0.3% |
| 780 | 9 | 0.2% |
| 326 | 8 | 0.2% |
| 300 | 8 | 0.2% |
| 396 | 7 | 0.2% |
| 2400 | 7 | 0.2% |
| Other values (2078) | 3249 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2260 | |
| 0 | 1965 | |
| 2 | 1853 | |
| 3 | 1610 | |
| 5 | 1483 | |
| 6 | 1469 | |
| 4 | 1461 | |
| 9 | 1365 | |
| 7 | 1364 | |
| 8 | 1356 | |
| Other values (8) | 108 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16186 | |
| Dash Punctuation | 92 | 0.6% |
| Uppercase Letter | 14 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2260 | |
| 0 | 1965 | |
| 2 | 1853 | |
| 3 | 1610 | |
| 5 | 1483 | |
| 6 | 1469 | |
| 4 | 1461 | |
| 9 | 1365 | |
| 7 | 1364 | |
| 8 | 1356 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 92 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16280 | |
| Latin | 14 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2260 | |
| 0 | 1965 | |
| 2 | 1853 | |
| 3 | 1610 | |
| 5 | 1483 | |
| 6 | 1469 | |
| 4 | 1461 | |
| 9 | 1365 | |
| 7 | 1364 | |
| 8 | 1356 | |
| Other values (2) | 94 | 0.6% |
Latin
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16294 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2260 | |
| 0 | 1965 | |
| 2 | 1853 | |
| 3 | 1610 | |
| 5 | 1483 | |
| 6 | 1469 | |
| 4 | 1461 | |
| 9 | 1365 | |
| 7 | 1364 | |
| 8 | 1356 | |
| Other values (8) | 108 | 0.7% |
X14
Text
| Distinct | 2045 |
|---|---|
| Distinct (%) | 55.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 4.3613079 |
| Min length | 1 |
Characters and Unicode
| Total characters | 16006 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 939 ? |
|---|---|
| Unique (%) | 25.6% |
Sample
| 1st row | BILL_AMT3 |
|---|---|
| 2nd row | 689 |
| 3rd row | 2682 |
| 4th row | 13559 |
| 5th row | 49291 |
| Value | Count | Frequency (%) |
| 0 | 384 | 10.5% |
| 390 | 31 | 0.8% |
| 200 | 12 | 0.3% |
| 780 | 11 | 0.3% |
| 2 | 9 | 0.2% |
| 291 | 9 | 0.2% |
| 316 | 8 | 0.2% |
| 396 | 7 | 0.2% |
| 2400 | 7 | 0.2% |
| 326 | 6 | 0.2% |
| Other values (2031) | 3186 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2274 | |
| 0 | 1982 | |
| 2 | 1881 | |
| 3 | 1610 | |
| 4 | 1415 | |
| 6 | 1394 | |
| 9 | 1379 | |
| 5 | 1359 | |
| 8 | 1350 | |
| 7 | 1264 | |
| Other values (8) | 98 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15908 | |
| Dash Punctuation | 82 | 0.5% |
| Uppercase Letter | 14 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2274 | |
| 0 | 1982 | |
| 2 | 1881 | |
| 3 | 1610 | |
| 4 | 1415 | |
| 6 | 1394 | |
| 9 | 1379 | |
| 5 | 1359 | |
| 8 | 1350 | |
| 7 | 1264 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 82 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15992 | |
| Latin | 14 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2274 | |
| 0 | 1982 | |
| 2 | 1881 | |
| 3 | 1610 | |
| 4 | 1415 | |
| 6 | 1394 | |
| 9 | 1379 | |
| 5 | 1359 | |
| 8 | 1350 | |
| 7 | 1264 | |
| Other values (2) | 84 | 0.5% |
Latin
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16006 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2274 | |
| 0 | 1982 | |
| 2 | 1881 | |
| 3 | 1610 | |
| 4 | 1415 | |
| 6 | 1394 | |
| 9 | 1379 | |
| 5 | 1359 | |
| 8 | 1350 | |
| 7 | 1264 | |
| Other values (8) | 98 | 0.6% |
X15
Text
| Distinct | 2009 |
|---|---|
| Distinct (%) | 54.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 4.2942779 |
| Min length | 1 |
Characters and Unicode
| Total characters | 15760 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 916 ? |
|---|---|
| Unique (%) | 25.0% |
Sample
| 1st row | BILL_AMT4 |
|---|---|
| 2nd row | 0 |
| 3rd row | 3272 |
| 4th row | 14331 |
| 5th row | 28314 |
| Value | Count | Frequency (%) |
| 0 | 424 | 11.6% |
| 390 | 25 | 0.7% |
| 316 | 15 | 0.4% |
| 291 | 10 | 0.3% |
| 326 | 9 | 0.2% |
| 300 | 8 | 0.2% |
| 150 | 8 | 0.2% |
| 2400 | 7 | 0.2% |
| 416 | 7 | 0.2% |
| 780 | 7 | 0.2% |
| Other values (1993) | 3150 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2295 | |
| 0 | 1960 | |
| 2 | 1731 | |
| 3 | 1514 | |
| 4 | 1444 | |
| 6 | 1388 | |
| 8 | 1370 | |
| 5 | 1353 | |
| 9 | 1326 | |
| 7 | 1279 | |
| Other values (8) | 100 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15660 | |
| Dash Punctuation | 84 | 0.5% |
| Uppercase Letter | 14 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2295 | |
| 0 | 1960 | |
| 2 | 1731 | |
| 3 | 1514 | |
| 4 | 1444 | |
| 6 | 1388 | |
| 8 | 1370 | |
| 5 | 1353 | |
| 9 | 1326 | |
| 7 | 1279 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 84 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15746 | |
| Latin | 14 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2295 | |
| 0 | 1960 | |
| 2 | 1731 | |
| 3 | 1514 | |
| 4 | 1444 | |
| 6 | 1388 | |
| 8 | 1370 | |
| 5 | 1353 | |
| 9 | 1326 | |
| 7 | 1279 | |
| Other values (2) | 86 | 0.5% |
Latin
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15760 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2295 | |
| 0 | 1960 | |
| 2 | 1731 | |
| 3 | 1514 | |
| 4 | 1444 | |
| 6 | 1388 | |
| 8 | 1370 | |
| 5 | 1353 | |
| 9 | 1326 | |
| 7 | 1279 | |
| Other values (8) | 100 | 0.6% |
X16
Text
| Distinct | 1984 |
|---|---|
| Distinct (%) | 54.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 4.2564033 |
| Min length | 1 |
Characters and Unicode
| Total characters | 15621 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 915 ? |
|---|---|
| Unique (%) | 24.9% |
Sample
| 1st row | BILL_AMT5 |
|---|---|
| 2nd row | 0 |
| 3rd row | 3455 |
| 4th row | 14948 |
| 5th row | 28959 |
| Value | Count | Frequency (%) |
| 0 | 460 | 12.5% |
| 390 | 28 | 0.8% |
| 150 | 13 | 0.4% |
| 396 | 11 | 0.3% |
| 316 | 11 | 0.3% |
| 2000 | 8 | 0.2% |
| 780 | 7 | 0.2% |
| 416 | 7 | 0.2% |
| 2400 | 7 | 0.2% |
| 1261 | 6 | 0.2% |
| Other values (1967) | 3112 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2224 | |
| 0 | 2058 | |
| 2 | 1660 | |
| 3 | 1562 | |
| 9 | 1464 | |
| 5 | 1366 | |
| 4 | 1324 | |
| 6 | 1320 | |
| 8 | 1301 | |
| 7 | 1239 | |
| Other values (8) | 103 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15518 | |
| Dash Punctuation | 87 | 0.6% |
| Uppercase Letter | 14 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2224 | |
| 0 | 2058 | |
| 2 | 1660 | |
| 3 | 1562 | |
| 9 | 1464 | |
| 5 | 1366 | |
| 4 | 1324 | |
| 6 | 1320 | |
| 8 | 1301 | |
| 7 | 1239 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 87 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15607 | |
| Latin | 14 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2224 | |
| 0 | 2058 | |
| 2 | 1660 | |
| 3 | 1562 | |
| 9 | 1464 | |
| 5 | 1366 | |
| 4 | 1324 | |
| 6 | 1320 | |
| 8 | 1301 | |
| 7 | 1239 | |
| Other values (2) | 89 | 0.6% |
Latin
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15621 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2224 | |
| 0 | 2058 | |
| 2 | 1660 | |
| 3 | 1562 | |
| 9 | 1464 | |
| 5 | 1366 | |
| 4 | 1324 | |
| 6 | 1320 | |
| 8 | 1301 | |
| 7 | 1239 | |
| Other values (8) | 103 | 0.7% |
X17
Text
| Distinct | 1948 |
|---|---|
| Distinct (%) | 53.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 4.1618529 |
| Min length | 1 |
Characters and Unicode
| Total characters | 15274 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 903 ? |
|---|---|
| Unique (%) | 24.6% |
Sample
| 1st row | BILL_AMT6 |
|---|---|
| 2nd row | 0 |
| 3rd row | 3261 |
| 4th row | 15549 |
| 5th row | 29547 |
| Value | Count | Frequency (%) |
| 0 | 532 | 14.5% |
| 390 | 22 | 0.6% |
| 780 | 18 | 0.5% |
| 150 | 17 | 0.5% |
| 316 | 12 | 0.3% |
| 326 | 11 | 0.3% |
| 291 | 11 | 0.3% |
| 200 | 8 | 0.2% |
| 396 | 7 | 0.2% |
| 1320 | 6 | 0.2% |
| Other values (1932) | 3026 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2226 | |
| 0 | 1981 | |
| 2 | 1593 | |
| 3 | 1555 | |
| 9 | 1385 | |
| 4 | 1365 | |
| 5 | 1354 | |
| 6 | 1305 | |
| 8 | 1292 | |
| 7 | 1131 | |
| Other values (8) | 87 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15187 | |
| Dash Punctuation | 71 | 0.5% |
| Uppercase Letter | 14 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2226 | |
| 0 | 1981 | |
| 2 | 1593 | |
| 3 | 1555 | |
| 9 | 1385 | |
| 4 | 1365 | |
| 5 | 1354 | |
| 6 | 1305 | |
| 8 | 1292 | |
| 7 | 1131 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 71 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15260 | |
| Latin | 14 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2226 | |
| 0 | 1981 | |
| 2 | 1593 | |
| 3 | 1555 | |
| 9 | 1385 | |
| 4 | 1365 | |
| 5 | 1354 | |
| 6 | 1305 | |
| 8 | 1292 | |
| 7 | 1131 | |
| Other values (2) | 73 | 0.5% |
Latin
| Value | Count | Frequency (%) |
| L | 4 | |
| B | 2 | |
| I | 2 | |
| A | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15274 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2226 | |
| 0 | 1981 | |
| 2 | 1593 | |
| 3 | 1555 | |
| 9 | 1385 | |
| 4 | 1365 | |
| 5 | 1354 | |
| 6 | 1305 | |
| 8 | 1292 | |
| 7 | 1131 | |
| Other values (8) | 87 | 0.6% |
X18
Text
| Distinct | 1147 |
|---|---|
| Distinct (%) | 31.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 3.5160763 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12904 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 478 ? |
|---|---|
| Unique (%) | 13.0% |
Sample
| 1st row | PAY_AMT1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1518 |
| 5th row | 2000 |
| Value | Count | Frequency (%) |
| 0 | 667 | 18.2% |
| 2000 | 147 | 4.0% |
| 3000 | 102 | 2.8% |
| 5000 | 76 | 2.1% |
| 10000 | 64 | 1.7% |
| 1000 | 62 | 1.7% |
| 2500 | 60 | 1.6% |
| 1500 | 55 | 1.5% |
| 4000 | 47 | 1.3% |
| 1600 | 30 | 0.8% |
| Other values (1137) | 2360 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4789 | |
| 1 | 1670 | 12.9% |
| 2 | 1266 | 9.8% |
| 3 | 1022 | 7.9% |
| 5 | 955 | 7.4% |
| 4 | 769 | 6.0% |
| 6 | 758 | 5.9% |
| 7 | 626 | 4.9% |
| 8 | 580 | 4.5% |
| 9 | 455 | 3.5% |
| Other values (6) | 14 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12890 | |
| Uppercase Letter | 12 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4789 | |
| 1 | 1670 | 13.0% |
| 2 | 1266 | 9.8% |
| 3 | 1022 | 7.9% |
| 5 | 955 | 7.4% |
| 4 | 769 | 6.0% |
| 6 | 758 | 5.9% |
| 7 | 626 | 4.9% |
| 8 | 580 | 4.5% |
| 9 | 455 | 3.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12892 | |
| Latin | 12 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4789 | |
| 1 | 1670 | 13.0% |
| 2 | 1266 | 9.8% |
| 3 | 1022 | 7.9% |
| 5 | 955 | 7.4% |
| 4 | 769 | 6.0% |
| 6 | 758 | 5.9% |
| 7 | 626 | 4.9% |
| 8 | 580 | 4.5% |
| 9 | 455 | 3.5% |
Latin
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12904 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4789 | |
| 1 | 1670 | 12.9% |
| 2 | 1266 | 9.8% |
| 3 | 1022 | 7.9% |
| 5 | 955 | 7.4% |
| 4 | 769 | 6.0% |
| 6 | 758 | 5.9% |
| 7 | 626 | 4.9% |
| 8 | 580 | 4.5% |
| 9 | 455 | 3.5% |
| Other values (6) | 14 | 0.1% |
X19
Text
| Distinct | 1130 |
|---|---|
| Distinct (%) | 30.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 3.4351499 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12607 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 468 ? |
|---|---|
| Unique (%) | 12.8% |
Sample
| 1st row | PAY_AMT2 |
|---|---|
| 2nd row | 689 |
| 3rd row | 1000 |
| 4th row | 1500 |
| 5th row | 2019 |
| Value | Count | Frequency (%) |
| 0 | 708 | 19.3% |
| 2000 | 146 | 4.0% |
| 1000 | 95 | 2.6% |
| 5000 | 95 | 2.6% |
| 3000 | 94 | 2.6% |
| 1500 | 89 | 2.4% |
| 1200 | 39 | 1.1% |
| 4000 | 38 | 1.0% |
| 1400 | 34 | 0.9% |
| 390 | 34 | 0.9% |
| Other values (1120) | 2298 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4635 | |
| 1 | 1730 | 13.7% |
| 2 | 1224 | 9.7% |
| 3 | 1009 | 8.0% |
| 5 | 968 | 7.7% |
| 4 | 712 | 5.6% |
| 6 | 640 | 5.1% |
| 7 | 591 | 4.7% |
| 9 | 567 | 4.5% |
| 8 | 517 | 4.1% |
| Other values (6) | 14 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12593 | |
| Uppercase Letter | 12 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4635 | |
| 1 | 1730 | 13.7% |
| 2 | 1224 | 9.7% |
| 3 | 1009 | 8.0% |
| 5 | 968 | 7.7% |
| 4 | 712 | 5.7% |
| 6 | 640 | 5.1% |
| 7 | 591 | 4.7% |
| 9 | 567 | 4.5% |
| 8 | 517 | 4.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12595 | |
| Latin | 12 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4635 | |
| 1 | 1730 | 13.7% |
| 2 | 1224 | 9.7% |
| 3 | 1009 | 8.0% |
| 5 | 968 | 7.7% |
| 4 | 712 | 5.7% |
| 6 | 640 | 5.1% |
| 7 | 591 | 4.7% |
| 9 | 567 | 4.5% |
| 8 | 517 | 4.1% |
Latin
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12607 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4635 | |
| 1 | 1730 | 13.7% |
| 2 | 1224 | 9.7% |
| 3 | 1009 | 8.0% |
| 5 | 968 | 7.7% |
| 4 | 712 | 5.6% |
| 6 | 640 | 5.1% |
| 7 | 591 | 4.7% |
| 9 | 567 | 4.5% |
| 8 | 517 | 4.1% |
| Other values (6) | 14 | 0.1% |
X20
Text
| Distinct | 1041 |
|---|---|
| Distinct (%) | 28.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 3.2615804 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11970 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 395 ? |
|---|---|
| Unique (%) | 10.8% |
Sample
| 1st row | PAY_AMT3 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1000 |
| 4th row | 1000 |
| 5th row | 1200 |
| Value | Count | Frequency (%) |
| 0 | 798 | 21.7% |
| 1000 | 172 | 4.7% |
| 2000 | 152 | 4.1% |
| 3000 | 100 | 2.7% |
| 5000 | 82 | 2.2% |
| 1500 | 55 | 1.5% |
| 4000 | 48 | 1.3% |
| 10000 | 44 | 1.2% |
| 2500 | 29 | 0.8% |
| 6000 | 29 | 0.8% |
| Other values (1031) | 2161 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4835 | |
| 1 | 1492 | 12.5% |
| 2 | 1036 | 8.7% |
| 3 | 888 | 7.4% |
| 5 | 883 | 7.4% |
| 6 | 694 | 5.8% |
| 4 | 596 | 5.0% |
| 7 | 538 | 4.5% |
| 8 | 516 | 4.3% |
| 9 | 478 | 4.0% |
| Other values (6) | 14 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11956 | |
| Uppercase Letter | 12 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4835 | |
| 1 | 1492 | 12.5% |
| 2 | 1036 | 8.7% |
| 3 | 888 | 7.4% |
| 5 | 883 | 7.4% |
| 6 | 694 | 5.8% |
| 4 | 596 | 5.0% |
| 7 | 538 | 4.5% |
| 8 | 516 | 4.3% |
| 9 | 478 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11958 | |
| Latin | 12 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4835 | |
| 1 | 1492 | 12.5% |
| 2 | 1036 | 8.7% |
| 3 | 888 | 7.4% |
| 5 | 883 | 7.4% |
| 6 | 694 | 5.8% |
| 4 | 596 | 5.0% |
| 7 | 538 | 4.5% |
| 8 | 516 | 4.3% |
| 9 | 478 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4835 | |
| 1 | 1492 | 12.5% |
| 2 | 1036 | 8.7% |
| 3 | 888 | 7.4% |
| 5 | 883 | 7.4% |
| 6 | 694 | 5.8% |
| 4 | 596 | 5.0% |
| 7 | 538 | 4.5% |
| 8 | 516 | 4.3% |
| 9 | 478 | 4.0% |
| Other values (6) | 14 | 0.1% |
X21
Text
| Distinct | 1035 |
|---|---|
| Distinct (%) | 28.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 3.2569482 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11953 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 416 ? |
|---|---|
| Unique (%) | 11.3% |
Sample
| 1st row | PAY_AMT4 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1000 |
| 4th row | 1000 |
| 5th row | 1100 |
| Value | Count | Frequency (%) |
| 0 | 808 | 22.0% |
| 1000 | 160 | 4.4% |
| 2000 | 142 | 3.9% |
| 3000 | 94 | 2.6% |
| 5000 | 91 | 2.5% |
| 1500 | 71 | 1.9% |
| 4000 | 49 | 1.3% |
| 500 | 41 | 1.1% |
| 2500 | 37 | 1.0% |
| 10000 | 36 | 1.0% |
| Other values (1025) | 2141 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4848 | |
| 1 | 1406 | 11.8% |
| 2 | 994 | 8.3% |
| 5 | 934 | 7.8% |
| 3 | 857 | 7.2% |
| 4 | 631 | 5.3% |
| 6 | 631 | 5.3% |
| 7 | 567 | 4.7% |
| 9 | 554 | 4.6% |
| 8 | 517 | 4.3% |
| Other values (6) | 14 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11939 | |
| Uppercase Letter | 12 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4848 | |
| 1 | 1406 | 11.8% |
| 2 | 994 | 8.3% |
| 5 | 934 | 7.8% |
| 3 | 857 | 7.2% |
| 4 | 631 | 5.3% |
| 6 | 631 | 5.3% |
| 7 | 567 | 4.7% |
| 9 | 554 | 4.6% |
| 8 | 517 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11941 | |
| Latin | 12 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4848 | |
| 1 | 1406 | 11.8% |
| 2 | 994 | 8.3% |
| 5 | 934 | 7.8% |
| 3 | 857 | 7.2% |
| 4 | 631 | 5.3% |
| 6 | 631 | 5.3% |
| 7 | 567 | 4.7% |
| 9 | 554 | 4.6% |
| 8 | 517 | 4.3% |
Latin
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4848 | |
| 1 | 1406 | 11.8% |
| 2 | 994 | 8.3% |
| 5 | 934 | 7.8% |
| 3 | 857 | 7.2% |
| 4 | 631 | 5.3% |
| 6 | 631 | 5.3% |
| 7 | 567 | 4.7% |
| 9 | 554 | 4.6% |
| 8 | 517 | 4.3% |
| Other values (6) | 14 | 0.1% |
X22
Text
| Distinct | 1039 |
|---|---|
| Distinct (%) | 28.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 3.2517711 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11934 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 420 ? |
|---|---|
| Unique (%) | 11.4% |
Sample
| 1st row | PAY_AMT5 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1000 |
| 5th row | 1069 |
| Value | Count | Frequency (%) |
| 0 | 827 | 22.5% |
| 1000 | 163 | 4.4% |
| 2000 | 123 | 3.4% |
| 3000 | 119 | 3.2% |
| 5000 | 77 | 2.1% |
| 1500 | 73 | 2.0% |
| 4000 | 48 | 1.3% |
| 2500 | 33 | 0.9% |
| 500 | 28 | 0.8% |
| 3500 | 27 | 0.7% |
| Other values (1029) | 2152 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4740 | |
| 1 | 1406 | 11.8% |
| 2 | 1003 | 8.4% |
| 3 | 938 | 7.9% |
| 5 | 922 | 7.7% |
| 4 | 674 | 5.6% |
| 6 | 637 | 5.3% |
| 7 | 557 | 4.7% |
| 8 | 526 | 4.4% |
| 9 | 517 | 4.3% |
| Other values (6) | 14 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11920 | |
| Uppercase Letter | 12 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4740 | |
| 1 | 1406 | 11.8% |
| 2 | 1003 | 8.4% |
| 3 | 938 | 7.9% |
| 5 | 922 | 7.7% |
| 4 | 674 | 5.7% |
| 6 | 637 | 5.3% |
| 7 | 557 | 4.7% |
| 8 | 526 | 4.4% |
| 9 | 517 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11922 | |
| Latin | 12 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4740 | |
| 1 | 1406 | 11.8% |
| 2 | 1003 | 8.4% |
| 3 | 938 | 7.9% |
| 5 | 922 | 7.7% |
| 4 | 674 | 5.7% |
| 6 | 637 | 5.3% |
| 7 | 557 | 4.7% |
| 8 | 526 | 4.4% |
| 9 | 517 | 4.3% |
Latin
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11934 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4740 | |
| 1 | 1406 | 11.8% |
| 2 | 1003 | 8.4% |
| 3 | 938 | 7.9% |
| 5 | 922 | 7.7% |
| 4 | 674 | 5.6% |
| 6 | 637 | 5.3% |
| 7 | 557 | 4.7% |
| 8 | 526 | 4.4% |
| 9 | 517 | 4.3% |
| Other values (6) | 14 | 0.1% |
X23
Text
| Distinct | 971 |
|---|---|
| Distinct (%) | 26.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 3.1525886 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11570 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 400 ? |
|---|---|
| Unique (%) | 10.9% |
Sample
| 1st row | PAY_AMT6 |
|---|---|
| 2nd row | 0 |
| 3rd row | 2000 |
| 4th row | 5000 |
| 5th row | 1000 |
| Value | Count | Frequency (%) |
| 0 | 949 | |
| 1000 | 176 | 4.8% |
| 2000 | 163 | 4.4% |
| 5000 | 96 | 2.6% |
| 3000 | 94 | 2.6% |
| 1500 | 62 | 1.7% |
| 4000 | 57 | 1.6% |
| 10000 | 39 | 1.1% |
| 2500 | 37 | 1.0% |
| 6000 | 28 | 0.8% |
| Other values (961) | 1969 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4906 | |
| 1 | 1279 | 11.1% |
| 2 | 988 | 8.5% |
| 5 | 867 | 7.5% |
| 3 | 793 | 6.9% |
| 6 | 622 | 5.4% |
| 4 | 621 | 5.4% |
| 7 | 533 | 4.6% |
| 8 | 474 | 4.1% |
| 9 | 473 | 4.1% |
| Other values (6) | 14 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11556 | |
| Uppercase Letter | 12 | 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4906 | |
| 1 | 1279 | 11.1% |
| 2 | 988 | 8.5% |
| 5 | 867 | 7.5% |
| 3 | 793 | 6.9% |
| 6 | 622 | 5.4% |
| 4 | 621 | 5.4% |
| 7 | 533 | 4.6% |
| 8 | 474 | 4.1% |
| 9 | 473 | 4.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11558 | |
| Latin | 12 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4906 | |
| 1 | 1279 | 11.1% |
| 2 | 988 | 8.5% |
| 5 | 867 | 7.5% |
| 3 | 793 | 6.9% |
| 6 | 622 | 5.4% |
| 4 | 621 | 5.4% |
| 7 | 533 | 4.6% |
| 8 | 474 | 4.1% |
| 9 | 473 | 4.1% |
Latin
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 2 | |
| Y | 2 | |
| M | 2 | |
| T | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11570 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4906 | |
| 1 | 1279 | 11.1% |
| 2 | 988 | 8.5% |
| 5 | 867 | 7.5% |
| 3 | 793 | 6.9% |
| 6 | 622 | 5.4% |
| 4 | 621 | 5.4% |
| 7 | 533 | 4.6% |
| 8 | 474 | 4.1% |
| 9 | 473 | 4.1% |
| Other values (6) | 14 | 0.1% |
Y
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| not default | |
|---|---|
| default | |
| default payment next month | 2 |
Length
| Max length | 26 |
|---|---|
| Median length | 11 |
| Mean length | 10.141689 |
| Min length | 7 |
Characters and Unicode
| Total characters | 37220 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | default payment next month |
|---|---|
| 2nd row | default |
| 3rd row | default |
| 4th row | not default |
| 5th row | not default |
Common Values
| Value | Count | Frequency (%) |
| not default | 2873 | |
| default | 795 | 21.7% |
| default payment next month | 2 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| default | 3670 | |
| not | 2873 | |
| payment | 2 | < 0.1% |
| next | 2 | < 0.1% |
| month | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 6549 | |
| e | 3674 | |
| a | 3672 | |
| d | 3670 | |
| f | 3670 | |
| u | 3670 | |
| l | 3670 | |
| n | 2879 | |
| 2879 | ||
| o | 2875 | |
| Other values (5) | 12 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34341 | |
| Space Separator | 2879 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 6549 | |
| e | 3674 | |
| a | 3672 | |
| d | 3670 | |
| f | 3670 | |
| u | 3670 | |
| l | 3670 | |
| n | 2879 | |
| o | 2875 | |
| m | 4 | < 0.1% |
| Other values (4) | 8 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2879 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34341 | |
| Common | 2879 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 6549 | |
| e | 3674 | |
| a | 3672 | |
| d | 3670 | |
| f | 3670 | |
| u | 3670 | |
| l | 3670 | |
| n | 2879 | |
| o | 2875 | |
| m | 4 | < 0.1% |
| Other values (4) | 8 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 2879 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37220 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 6549 | |
| e | 3674 | |
| a | 3672 | |
| d | 3670 | |
| f | 3670 | |
| u | 3670 | |
| l | 3670 | |
| n | 2879 | |
| 2879 | ||
| o | 2875 | |
| Other values (5) | 12 | < 0.1% |
| X10 | X11 | X2 | X3 | X4 | X6 | X7 | X8 | X9 | Y | |
|---|---|---|---|---|---|---|---|---|---|---|
| X10 | 1.000 | 0.720 | 0.708 | 0.506 | 0.501 | 0.538 | 0.582 | 0.701 | 0.770 | 0.729 |
| X11 | 0.720 | 1.000 | 0.708 | 0.508 | 0.500 | 0.476 | 0.497 | 0.599 | 0.709 | 0.718 |
| X2 | 0.708 | 0.708 | 1.000 | 0.708 | 0.708 | 0.707 | 0.708 | 0.708 | 0.708 | 0.707 |
| X3 | 0.506 | 0.508 | 0.708 | 1.000 | 0.512 | 0.508 | 0.511 | 0.510 | 0.508 | 0.708 |
| X4 | 0.501 | 0.500 | 0.708 | 0.512 | 1.000 | 0.501 | 0.501 | 0.501 | 0.501 | 0.707 |
| X6 | 0.538 | 0.476 | 0.707 | 0.508 | 0.501 | 1.000 | 0.710 | 0.608 | 0.555 | 0.755 |
| X7 | 0.582 | 0.497 | 0.708 | 0.511 | 0.501 | 0.710 | 1.000 | 0.724 | 0.614 | 0.733 |
| X8 | 0.701 | 0.599 | 0.708 | 0.510 | 0.501 | 0.608 | 0.724 | 1.000 | 0.752 | 0.730 |
| X9 | 0.770 | 0.709 | 0.708 | 0.508 | 0.501 | 0.555 | 0.614 | 0.752 | 1.000 | 0.724 |
| Y | 0.729 | 0.718 | 0.707 | 0.708 | 0.707 | 0.755 | 0.733 | 0.730 | 0.724 | 1.000 |
| X1 | X2 | X3 | X4 | X5 | X6 | X7 | X8 | X9 | X10 | X11 | X12 | X13 | X14 | X15 | X16 | X17 | X18 | X19 | X20 | X21 | X22 | X23 | Y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | LIMIT_BAL | SEX | EDUCATION | MARRIAGE | AGE | PAY_0 | PAY_2 | PAY_3 | PAY_4 | PAY_5 | PAY_6 | BILL_AMT1 | BILL_AMT2 | BILL_AMT3 | BILL_AMT4 | BILL_AMT5 | BILL_AMT6 | PAY_AMT1 | PAY_AMT2 | PAY_AMT3 | PAY_AMT4 | PAY_AMT5 | PAY_AMT6 | default payment next month |
| 1 | 20000 | female | university | 1 | 24 | 2 | 2 | -1 | -1 | -2 | -2 | 3913 | 3102 | 689 | 0 | 0 | 0 | 0 | 689 | 0 | 0 | 0 | 0 | default |
| 2 | 120000 | female | university | 2 | 26 | -1 | 2 | 0 | 0 | 0 | 2 | 2682 | 1725 | 2682 | 3272 | 3455 | 3261 | 0 | 1000 | 1000 | 1000 | 0 | 2000 | default |
| 3 | 90000 | female | university | 2 | 34 | 0 | 0 | 0 | 0 | 0 | 0 | 29239 | 14027 | 13559 | 14331 | 14948 | 15549 | 1518 | 1500 | 1000 | 1000 | 1000 | 5000 | not default |
| 4 | 50000 | female | university | 1 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 46990 | 48233 | 49291 | 28314 | 28959 | 29547 | 2000 | 2019 | 1200 | 1100 | 1069 | 1000 | not default |
| 5 | 50000 | male | university | 1 | 57 | -1 | 0 | -1 | 0 | 0 | 0 | 8617 | 5670 | 35835 | 20940 | 19146 | 19131 | 2000 | 36681 | 10000 | 9000 | 689 | 679 | not default |
| 6 | 50000 | male | graduate school | 2 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 64400 | 57069 | 57608 | 19394 | 19619 | 20024 | 2500 | 1815 | 657 | 1000 | 1000 | 800 | not default |
| 7 | 500000 | male | graduate school | 2 | 29 | 0 | 0 | 0 | 0 | 0 | 0 | 367965 | 412023 | 445007 | 542653 | 483003 | 473944 | 55000 | 40000 | 38000 | 20239 | 13750 | 13770 | not default |
| 8 | 100000 | female | university | 2 | 23 | 0 | -1 | -1 | 0 | 0 | -1 | 11876 | 380 | 601 | 221 | -159 | 567 | 380 | 601 | 0 | 581 | 1687 | 1542 | not default |
| 9 | 140000 | female | high school | 1 | 28 | 0 | 0 | 2 | 0 | 0 | 0 | 11285 | 14096 | 12108 | 12211 | 11793 | 3719 | 3329 | 0 | 432 | 1000 | 1000 | 1000 | not default |
| X1 | X2 | X3 | X4 | X5 | X6 | X7 | X8 | X9 | X10 | X11 | X12 | X13 | X14 | X15 | X16 | X17 | X18 | X19 | X20 | X21 | X22 | X23 | Y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3660 | 380000 | male | university | 1 | 50 | 0 | 0 | 0 | 0 | 0 | 0 | 385662 | 294826 | 220022 | 154283 | 35270 | 332270 | 12020 | 9009 | 6109 | 3000 | 332000 | 12000 | default |
| 3661 | 50000 | male | university | 1 | 44 | 0 | 0 | 0 | 0 | 0 | 0 | 45335 | 46027 | 30286 | 26275 | 26823 | 27371 | 1524 | 1427 | 941 | 972 | 992 | 1000 | not default |
| 3662 | 150000 | female | high school | 1 | 43 | -1 | -1 | 2 | 0 | -1 | -1 | 264 | 948 | 632 | 316 | 316 | 1414 | 1000 | 0 | 0 | 316 | 1414 | 0 | default |
| 3663 | 220000 | male | university | 2 | 29 | 0 | 0 | 0 | 0 | 0 | 0 | 122286 | 122839 | 123035 | 114385 | 115903 | 118528 | 5008 | 5007 | 6007 | 5000 | 4700 | 5503 | not default |
| 3664 | 80000 | female | other | 2 | 27 | 0 | 0 | 0 | 0 | 0 | 0 | 45268 | 47140 | 47411 | 48443 | 49478 | 43264 | 2600 | 1800 | 1700 | 1700 | 1700 | 1300 | not default |
| 3665 | 220000 | female | university | 1 | 32 | 0 | 0 | 0 | 0 | 0 | 0 | 194961 | 197536 | 203251 | 208355 | 213015 | 217475 | 7200 | 9000 | 10000 | 8000 | 8010 | 8500 | not default |
| 3666 | 70000 | female | university | 2 | 34 | 1 | 2 | 2 | 2 | 0 | 0 | 24208 | 25015 | 27189 | 26456 | 28361 | 31873 | 1500 | 2900 | 0 | 2500 | 4000 | 0 | not default |
| 3667 | 120000 | male | university | 2 | 37 | -1 | 2 | 0 | 0 | 0 | 2 | 16241 | 16680 | 17695 | 17901 | 19608 | 19143 | 1000 | 1600 | 800 | 2000 | 0 | 1600 | default |
| 3668 | 180000 | female | university | 2 | 32 | 0 | 0 | 0 | 0 | 0 | 0 | 20730 | 17107 | 35884 | 31057 | 29052 | 25933 | 1582 | 30000 | 1000 | 1000 | 1000 | 1000 | not default |
| 3669 | 50000 | female | high school | 1 | 57 | 0 | 0 | 0 | 0 | 0 | 0 | 49017 | 50690 | 47487 | 48319 | 48449 | 49656 | 2500 | 2000 | 2000 | 1746 | 2000 | 1800 | not default |
Most frequently occurring
| X1 | X2 | X3 | X4 | X5 | X6 | X7 | X8 | X9 | X10 | X11 | X12 | X13 | X14 | X15 | X16 | X17 | X18 | X19 | X20 | X21 | X22 | X23 | Y | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10000 | female | high school | 2 | 22 | 0 | 0 | 0 | 0 | -2 | -2 | 8109 | 9778 | 8259 | 0 | 0 | 0 | 2000 | 1036 | 0 | 0 | 0 | 0 | default | 2 |
| 1 | 10000 | female | university | 1 | 31 | 0 | 0 | 0 | 0 | 0 | 0 | 15915 | 9050 | 9901 | 9975 | 9736 | 8703 | 2330 | 2200 | 1000 | 333 | 311 | 322 | not default | 2 |
| 2 | 10000 | female | university | 2 | 22 | 1 | 2 | 0 | 0 | 0 | 0 | 10250 | 8558 | 10525 | 10050 | 9903 | 9984 | 0 | 2126 | 390 | 328 | 476 | 1287 | not default | 2 |
| 3 | 10000 | male | high school | 2 | 23 | 0 | 0 | 0 | 0 | 0 | 2 | 6974 | 7838 | 9002 | 9182 | 9729 | 9411 | 1134 | 1298 | 478 | 847 | 0 | 175 | not default | 2 |
| 4 | 10000 | male | high school | 2 | 35 | 0 | 0 | 0 | 0 | 0 | 0 | 7877 | 8918 | 9864 | 9673 | 9414 | 9156 | 1174 | 1120 | 310 | 316 | 1000 | 2000 | not default | 2 |
| 5 | 10000 | male | university | 1 | 32 | 1 | 2 | 2 | 2 | 2 | 2 | 8425 | 8148 | 9481 | 9180 | 10052 | 10091 | 0 | 1632 | 0 | 1022 | 350 | 0 | not default | 2 |
| 6 | 10000 | male | university | 1 | 45 | 0 | 0 | 0 | 2 | 0 | 0 | 7139 | 8416 | 9815 | 9508 | 9754 | 10192 | 1400 | 1700 | 0 | 400 | 600 | 200 | default | 2 |
| 7 | 10000 | male | university | 1 | 56 | 2 | 2 | 2 | 0 | 0 | 0 | 2097 | 4193 | 3978 | 4062 | 4196 | 4326 | 2300 | 0 | 150 | 200 | 200 | 160 | default | 2 |
| 8 | 10000 | male | university | 2 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 1877 | 3184 | 6003 | 3576 | 3670 | 4451 | 1500 | 2927 | 1000 | 300 | 1000 | 500 | not default | 2 |
| 9 | 10000 | male | university | 2 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 7960 | 9649 | 8518 | 8628 | 9293 | 5033 | 2000 | 1000 | 500 | 1500 | 0 | 2500 | default | 2 |